RAW SEQUENCE LISTING 
ERROR REPORT 

be6t available copy 



BIOTECHls^LOGY 
SYSTEMS 
BRANCH 




/ 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form:_^ 

Application Serial Number: 
Source: 

Date Processed by STIC: 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: Datin21help@.uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help(%uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organi2ation (WIPO) Standard ST.25. 

Checker Version 3 .0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 



http://www.uspto.gov/web/offices/pac/checker 
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RAW SEQUENCE LISTING 
PATENT APPLICATION: US/09/888,182 

Input Set : A:\ES,txt 

Output Set: N:\CRF3\11012001\I888182.raw 



DATE: 11/01/2001 
TIME: 14:15:17 



Does Not Comply 
Corrected Diskette Needed 



ERRORED SEQUEIi! 




W- 



W- 
W- 
W- 
W- 



<210> SEQ ID NO: 1 

<211> LENGTH: 27 
<212> TYPE: DNA 
<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: Erk5-specif ic 

primer 
<110> APPLICANT: 
<120> TITLE INVENTION: 
<i60> NUMBER OF SEQ ID NOS: 
<400> SEQUENCE: 1 



10 cagccattcg atgtgggccc acgcta 



26 





file://C:\Crf3\Outhold\VsrI888182.htm 
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VERIFICATION SUMMARY DATE: 11/01/2001 

PATENT APPLICATION: US/09/888,182 TIME: 14:15:18 

* Input Set : A:\ES.txt 

Output Set: N:\CRF3\11012001\I888182.raw 

L:2 M:283 W: Missing Blank Line separator, <210> field identifier 

L:6 M:283 W: Missing Blank Line separator, <220> field identifier 

L:0 M:282 W: Numeric Field Identifier Missing, <110> is required. 

L:0 M:282 W: Numeric Field Identifier Missing, <120> is required. 

L:0 M:282 W: Numeric Field Identifier Missing, <160> is required. 

L:9 M:283 W: Missing Blank Line separator, <400> field identifier 

L:10 M:252 E: No. of Seq . differs, <211>LENGTH : Input : 27 Found:26 SEQ:1 

L:16 M:283 W: Missing Blank Line separator, <220> field identifier 

L:19 M:283 W: Missing Blank Line separator, <400> field identifier 

L:26 M:283 W: Missing Blank Line separator, <220> field identifier 

L:30 M:283 W: Missing Blank Line separator, <400> field identifier 

L:31 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

L:37 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

L:41 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# ; 3 

L:42 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

L:43 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

L:44 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

L:50 M:283 W: Missing Blank Line separator, <220> field identifier 

L:54 M:283 W: Missing Blank Line separator, <220> field identifier 

L:58 M:283 W: Missing Blank Line separator, <400> field identifier 

L:59 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 4 

L:60 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 4 

L:61 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 4 

L:62 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 4 

L:65 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 4 

L:78 M:283 W: Missing Blank Line separator, <220> field identifier 

L:82 M:283 W: Missing Blank Line separator, <400> field identifier 

L:83 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 5 

L:85 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 5 

L:88 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 5 

L:99 M:283 W: Missing Blank Line separator, <220> field identifier 

L:106 M:283 W: Missing Blank Line separator, <400> field identifier 

L:107 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 6 

L:108 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 6 

L:109 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 6 

L:110 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 6 

L:lll M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 6 

L:112 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 6 

L:113 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 6 

L:0 M:203 E: No. of Seq. differs, <160> Number Of Sequences : Input (0) Counted (6) 
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€EQUEN e £ LISTING 




<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Erk5-specif ic 

primer 
<400> 1 

cagccattcg atgtgggccc acgcta 26 



<210> 3 

<211> 802 

<212> DNA 

<213> Mus musculus 

<220> 

<221> misc_feature 
<222> (1) . . (38) 

<223> partial sequence of expn 2 from Erk5 gene 

cgg^cctac tgtgccctat ggaggaattc agatctgtgt aagggagtgg gccaggagga 60 Jj^l^ ^ 
ggagacacag tcgggatcag cttagaagcc caggttcagt aatactgaag ttctggcagg 120 ^^/t^u/^y 
gcggttgaac ccagagtgat gcgggctgtg agtccaggac attggtaggg acagttctta 180 * 





Use of n and/or Xaa has been deteaed in the Sequence Listing. 
Review the Sequence Listing to insure a corresponding 
explanation is presented in the <220> to <223> fields of 
eacli sequence using n or Xaa* 




<110> Smith. John; Smithgcnc Inc. 

<120> Example of a Sequence Listing 



thdrJj^^^^^Jy 01-00001 

u <1<0> PCT/EP98/00001 

<\Cl> 1998-12-31 



<150> US 08/999,999 

<151> 1997-10-15 

<1C0> < ' ' 

<170> Patcntin version 2.0 

<2 10> 1 
<21 1> 389 

< 2 1 2 > DNA 

<2 13> ('Paramecium sp. 

<220> 

<22 1> CDS . 

< 2 2 2> (279) . (389) 

<300> 

<301> Doe. Richard 

< 302 > Isolation and Ctia ractcr i za t i on of a Ccnc Encoding a 

Protease from Paramecium sp . 

<303> Journal of Genes 

<304> 1 

<30S> 4 

<30G> 1-7 

<307> 1988-0G-31 

<308> 1234SG 

<309> 1988-06-31 

<400> 1 

agctgLagtc attcctgtgr cctcttctcc ctgggcutcL caccccgcta atcagatccc 60 

'A • ■ ^' 

agggagagcg tcttgacccc; cctccgcctc cgcagcttco caggcaggcxi ggcaggcagc 120 

cgaugtggca actgccggca gtgccacagg cttcccagcc aggcttaggg cgggccccgc lOO 

cgcggcgcgg cggcccctct cgcgcccctc tcgcgcctcc cccccgctcc cctctcgccc 240 



PClfM/f:Xi:C/22/6 
Annex VII. page 30 



Appendix 3, page 2 

ggacctoatt aggtgagcag gaggaggggg cagttagc acg get tea acg tec age 29C 

Met Val Scr Met Phc Scr 

1 5 

ttg tec ttc aaa tgg cct gga ttt tgt ttg ttt gtt tgt ttg tee caa 3< 

Leu scr Phc Lys . Trp Pro Gly Phc Cys Leu Phc Val Cys Leu Phc Cln 

iO 15 20 



tgt eec aaa gtc ete ccc tgt cae tea tea etg cag ccg . aat ett ^ 399 

Cys Pro Lys -Val Leu Pro Cys Mis Scr Scr •4.cu Gin Pro Asn-_:±3«u-- 
25 30 \' 35 

<210> 2 

<211> 3? 

<212> PRT ' ^ 

<213> Paramecium sp. 

<<00> 2 

Met Val Scr Net Phe Scr Leu Scr Phc Lys Trp Pro Gly Phc. Cys Leu 

1 . s 10 IS 

Phc Val Cys Leu Phe Cln Cys Pro Lys Val Leu Pro Cys His Scr Scr 
20 25 30 

Leu Cln Pro Asn Lou 

<7\0> 3 

< 2 1 1 > 11 

<212> PKT 

<213> Artificial Sequence 
<220> 

<223> Designed peptide hosed on size and polarity to acc as 0 
linker between the alpha and beta chains of Protein XYZ. 

<400> 3 

Met val Asn Leu Clu Pro Met His Tlir Clu lie 

I S 10 

<210> 4 

<400> 4 
000 



(Annex VIII follows) 
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cable. The numeric identifier shall be used only in Che Sequence 
Listing." The order and presentation of the items of information in the 
••Sequence Listing*' shall conform to the arrangement given below. Each 
item of information shall begin on a new line and shall begin with the 
numeric identifier enclosed in angle brackets as shown. The submission 
of those items of information designated with an •'M" is mandatory. The 
subrrvission of those items of information designated with an "O" is 
optional. Numeric identifiers <110> through <170> shall only, be set 
forth at the beginning of the "Sequence Listing." Tpe following table 
illustrat-es the numeric identifiers. 



Numeric 
Identi f ier 



Definition 



Comments and 
Fo rma t 



Mandatory (M) or 
Optional .(O) 



<110> 



Applicant 



Preferably max. 

of 10 names; 

one name per line; 

preferable .format: 

Surname, Other/- 

Names and/or 

Initials 



M 



<120> 



Title of 
Invention 



M 



<130> 



Pile Reference 



Personal file 
rc f c rcncc 



M^. when filed prior 
to assignment of 
appl . .num):)cr 



<1^,0> 



Current Applica- 
tion Number 



Specify as: 

US 07/999, 999 or 

PCT/US96/99999 



M- if a va i labl c 



<14 1> 



Current Filirig 
Date 



Specify as: yyyy-mm-dd M, if available 



<150> 



Prior Application Specify as: 
Number US 07/999,999 or 

PCT/US96/99999 



M, if appi icable 
include priority 
documents under 
35 use 119 and 
120 



<151> 



Prior Application Specify as: yyyy-mm-dd M, if applicable 
filing Date 



<1G0> 



Number of SEQ ID Count includes 



NOs 



total number of 
SEQ ID NOs 



<170> 



Sof twa re . 



Name of software used 
to create the 
Sequence Listing 



<210> 



SEQ ID NO: « : 



Response shall be an 
integer repre- 
senting the SEQ 
ID NO shown 



<211> 



Length 



Respond with an integer M 
expressing the number 
of bases or amino acid 
res idues 



1/29^99 I S) VM 



iiuiictiicniv lor A(»plic.it.oMi - OC l):iic. ?J iunc \00Z 



<212> 



Type 



<213> 



Organism 



• v/hcchcr presented 
sequence mole- 
cule is DNA, 
RNA, or PRT 
(protein) . I f 
a nucleotide 
sequence con- 
tains both DNA 
and RNA frag- 
ments, the 
type shall be 
"DNA. " In ad- 
dition, the 
combined DNA/ 
fU^A molecule 
shall be further 
described in 
the <220> to 
<223> feature 
section. /- 

Scientific name, 
i.e. Gcnus/spccics , 
Unknown or Artifi- 
cial Sequence. In 
addition, the 
"Unknown" or 
"Artificial Se- 
quence" organisms 
shall be further 
described in the 
<220> to <223> 
feature section. 



M 



<220> 



Pea turc 



Leave blank after 
<220>. <221-223> 
provide for a 
description of 
points of bio- 
logical signi- 
ficance in the 
sequence . 



M, under the 
following condi- 
tions : i f "n, " 
"Xaa , " or a mod- 
i f i cd or unusua 1 
L-amino acid or 
modified base was 
used in a se- 
quence; if ORGAN- 
ISM is "Artifi- 
cial Sequence" or 
"Unknown" ; i f 
molecule is 
combined DNA/RNA. 



<221> 



Name/ Key 



Provide appropriate 
ident i f ier for 
f ea t u r e , pre- 
ferably f rom 
WIPO Standard 
ST.2S (1998), 
Appe nd i X 2 , 
Tables 5 and 6 



M, undc r the f ol - 
lowing conditions: 
if "n , "Xaa , " or 
a mod ified or un- 
usual L-amino 
acid or modified 
base was used in 
a sequence 



<222> 



Loca t i on 



Spec i f y loca t ion 
within sequence; 
where appropriate 
state number of 
first and last 
bases /ami no acids 



M, under the fol- 
lowing conditions: 
if "n, " "Xaa , *' or 
a modified or un- 
usua 1 L-amino ^ 
acid or mod i f i ed 



l/2t>/99 I Si f*M 



<223> 



other Infor- 
mation 



•itK qf .tictti 



in feature 



Other relevant 

information ; 

four lines maximum 



base was used in 
a sequence 

M, under the f ol - 
lowing conditions: 
if "n, " "Xaa, or 
a modified or un- 
usual L-aruino acid 
or modified base 
was used in a 
sequence; if 
ORGANISM 
is "Artificial 
Sequence*' or . 
"Unknown"; .i^.^=A 
molecule is com- 
bined DNA/RNA. 



<300> 



Publ ication 
Inf orma tion 



Leave blank 
after <300> 



O 



<301> 



Authors 



Preferably max 
of ten named 
authors of publi- 
cation; specify 
one name per line; 
preferable format : 
Surname, Other 
Names and/or 
Initials 



<302> 
<303> 
<304> 
<305> 
<30G> 
<307> 



< 300> 



Title 

Jou rna 1 

Volume 

Issue 

Pages 

Date 



Da t aba sc 
/\cccss i on 
Numbe r 



Journal date on which 
data published; 
specify as yyyy-mm- 
dd, r<MM-yyyy or 
Season - y yy y 

Accession number 
assigned by data- 
base including 
database name 



O 



O 



O 



O 



<309> 



<310> 



Database Entry 
Date 



Patent Document 
Numbe r 



Date of entry in 
database; specify 
as yyyy-mm-dd or 
MMM- yyyy 

Document number; 
for patent-type 
citations only. 
Spec i f y a s , for 
example, US 
07/999. 999 



<31 1> 



<312> 



Patent Filing 
Date 



Document filing 
date, for patent- 
type citations only; 
specify as yyyy-mm-dd 



Publication Date Document publication O • 
date, for 
patent-type 
citations only; 
specify as yyyy-mm-dd; 

'<313> Relevant FROM (position) TO O 

Residues (position) 

<400> Sequence SEQ ID NO should ' . ;M 

follow the 
numeric identifier 
and should appear 
on tl»c line prc- 
ceding the actual • 
sequence . - 

5. Section 1.024 is revised to read as follows: 

1.024 Form and format for nucleotide and/or amino acid sequence 
submissions in computer readable form. 



(a) The computer readable form required by 
following specifications: 



1.021 (c) s)ial 1 mccL the 



(1) The computer rcadablt? form shall contain a single "Scqucticc LisLirig" 
as cither a diskette, scries of diskettes, or o the r pc rmi s s ibl c media 
outlined in paragraph (c) of this section. 

(2) The "Sequence Listing*', in paragraph (a) (1) of this section shall be 
submitted in American Standard Code for Information Interchange (ASCII) 
text. No other formats shall be allowed. 

(3) The computer readable form may be created by any (neans, such as word 
processors, nucleotide/amino acid sequence editors or other custom 
computer programs; however, it shall conform to all specifications 
detailed in t)iis section. 



(4) File compression is acceptable when using diskette media, so long as 
the compressed file is in a se 1 f -ex t ract ing format that will decompress 
on one of t)ie systems described in paragraph (b) of tliis section. 

(5) Page num)>cring shall not appear within the computer readable form 
version of the "Sequence Listing" file. 

(6) All computer readable forms shall have a. label permanently affixed 
thereto on which has been hand-printed or typed: the name of the 
applicant, the title of the invention, the date on whicli the data were 
recorded on the computer readable form, the operating system used, a 
reference numl^er. and an application serial number and filing date, if 
known . 

(b) Computer readable form submissions must meet these format 
requ i remcn t s : 

(I) Computer: I DM PC/XT/AT. or compatibles, or Apple Macintosh; 



(2» Operating System: MS-DOS, Unix or Macintosh; 



